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Abstract. The existence of a phase transition with diverging susceptibility in 
batch Minority Games (MGs) is the mark of informationally efficient regimes 
and is linked to the specifics of the agents' learning rules. Here we study how the 
standard scenario is affected in a mixed population game in which agents with the 
'optimal' learning rule (i.e. the one leading to efficiency) coexist with ones whose 
adaptive dynamics is sub-optimal. Our generic finding is that any non-vanishing 
intensive fraction of optimal agents guarantees the existence of an efficient phase. 
Specifically, we calculate the dependence of the critical point on the fraction q of 
'optimal' agents focusing our analysis on three cases: MGs with market impact 
correction, grand-canonical MGs and MGs with heterogeneous comfort levels. 
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1. Introduction 

It is now well known that Minority Games (MGs) can display two types of phase 
transitions separating ergodic phases from a non-ergodic regimes [1-3]. One type of 
transition is characterized by a diverging susceptibility signalling the existence of an 
informationally efficient phase with vanishing predictability of the bid time-series [4,5]. 
The second type of transition (referred to as memory-onset transition) occurs instead 
at finite integrated response and is marked by a de Almeida-Thouless- instability and a 
replica-symmetry broken phase at non-zero predictability, similar to phase transitions 
observed in models of spin glasses [6,7]. 

From a physical viewpoint, the divergence of static and dynamic susceptibilities signals 
a sensitivity of the MG-dynamics to perturbations in the stationary state. A geometric 
interpretation of this phenomenon has been devised in [8] , and rests on the observation 
that the microscopic iV-dimensional state vector describing the system of N interacting 
agents evolves in an aA'^-dimensional space spanned by the quenched disorder of the 
problem. Here a is the key control parameter of the model. The analysis of the MG 
shows that the effective dimension of phase space is reduced to {1 — (j))N as a fraction 
(/) = (f>{a) of agents 'freezes' during the course of the dynamics: frozen agents are those 
who use just one of their strategies in the long run, so that their degrees of freedom 
are effectively removed from the dynamics. 

The breakdown of ergodicity is observed to occur at some a — ac satisfying the 
condition [1 — (j){ac)]N = acN, i.e. it occurs when the dimensionality of the reduced 
phase space becomes equal to that of the space in which the effective dynamics is 
defined. 
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The aim of the present paper is to test variations of the MG, in which the above 
geometric picture is systematically modified, for the existence or otherwise of phase 
transitions at diverging integrated response and for the presence of efficient phases 
associated with this type of transition. Such alterations of the model occur naturally 
when an additional degree of heterogeneity (besides the quenched randomness of the 
strategies) is added to the agents' learning rules, and appear to be a key ingredient of 
more realistic models of the learning of agents. They are indeed very much in the spirit 
of David Sherrington's approach to Minority Games. David suggested the addition of 
heterogeneity and complexity, and to study their effects on the phase behaviour of the 
MG in numerous discussions as well as in earlier joint articles with the authors of the 
present paper [9, 10], and it is a pleasure to submit work along this line to the special 
issue in honour of David's 65th birthday. 

Specifically, we will here consider MGs with agent-dependent impact correction, grand- 
canonical MGs [11] with heterogeneous incentives to trade, and El-Farol type games 
with heterogeneous comfort levels [12, 13]. We demonstrate that in order to observe a 
transition to an efficient phase, it is for a large class of MGs necessary, and in absence 
of memory onset transitions also siifficicnt, that the above geometric interpretation 
holds for a finite fraction of the agents. In turn, no efficient phase occurs when no 
such group of agents exists. 

2. Definitions and general remgirks on the efHcient-inefHcient transition 

Batch MGs [5] are discrete zero-temperature dynamical systems describing the coupled 
time-evolution of N agents, labelled by i = 1, . . . , in the following. Each agent, in 
the simplest setup, is described by one continuous dynamical variable qi{t) which 
evolves according to the following rules: 



a is here a finite positive control parameter while {Cfji^f} ^ire quenched random 
variables, usually drawn independently from the set { — 1.0,1} with weights 
1/4,1/2,1/4, satisfying uj'^^'^ — 0. hi{t) is an external perturbation field, used to 
measure the response of the system, and will be set to zero eventually. These equations 
can be derived from a finance-inspired setup that has been discussed at length in 
the literature (see e.g. [2]) and we shall not repeat it here in detail. In a nutshell, 
agents' choices are encoded in the Ising spins Si(t), whose possible values represent 
the two (heterogeneous) trading strategies of which every agent disposes. qi(t) is then 
a 'valuation' by which agent i assesses the performance of his strategies, so that if 
qi{t) ±cx3 asymptotically, then agent i will stick to one of his strategics {si{t) — !■ ±1) 
when t ^ oo. Otherwise, he will keep switching strategies forever. These two types of 
agents are referred to as 'frozen' and 'fickle', respectively. Note that frozen agents are 
insensitive to (small) perturbations of the dynamics in the steady state, /x denotes the 
state of the world and may take on P = aN values. The quantity Ai^ represents in 
turn the bid imbalance ('excess demand') in state /i. Efficient states are characterized 




aN 




(1) 



Si{t) = sign[qi{t)]. 
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by zero bid imbalance (on average) and zero predictability. In an efficient phase no 
statistical forecast of a bid imbalance is possible in any state fi. The predictability of 
the system is measured by the quantity H = {aN)~^ J^fi (^Im)^i with (-j/Lt) a time- 
average conditioned on the occurrence of information pattern fi. The condition H = 
implies {A\fj,) = for all fj, = 1, . . . ,aN, while H > signals the presence of an 
exploitable pocket of predictability in the time series of bid imbalances. 
The dynamic update rules ^ demonstrate that, in absence of perturbations hi{t), 
the state vector q(i) — {qi{t), . . . , qN{t)) moves in the space spanned by the aN N- 
dimensional vectors . . . A breakdown of the ability to remove dynamical 

perturbations hence occurs when the space of possible perturbations assumes a higher 
dimensionality than aN. Due to the freezing of (j){a)N agents, [1 — (j){a)]N linearly 
independent modes of perturbations can be applied. The condition for ergodicity 
hence reads 1 — </>(«) < a so that ergodicity breaking occurs at a value of a such that 
ttc = 1 - 4>{ac)- 

The relation of this type of transition with the existence of a fully efhcient phase, in 
which H — 0, can be understood as follows: as discussed above H = Q implies P = aN 
conditions, one on each {A\ii). With (1 — (f>)N effective degrees of freedom available 
the system is able to evolve into an asymptotic stationary state H = ai most when 
1 — (/) > a, i.e. in the phase where ergodicity is broken. Hence, the onset of a divergence 
of the integrated response function occurs precisely at the phase boundary separating 
a fully efficient {H = 0) non-ergodic phase from an inefficient {H > 0) ergodic one. 
It is here worth noting that, although H is not a strict Lyapunov function of the MG 
dynamics [1,2], the pseudo-Hamiltonian H is effectively minimised in the stationary 
states of the dynamics. This observation in fact allows for equilibrium approaches to 
the MG, based on the replica method. We will not pursue these in the present paper, 
however, but restrict to dynamical analyses using generating functional techniques [2] . 
The geometric picture of q moving in the space spanned by the ^'"s is violated, 
whenever the bid imbalance is agent-specific, i.e. when 

g,(t + l)=q,(i)--^f]erAf[q(t)], (2) 

where now carries an explicit index i. This is the case in MGs in which agents 
correct for their own impact on the global bid [14], in dilute MGs [15] and in El-Farol 
games with heterogeneous comfort levels [12,13]. The update rule of grand-canonical 
Minority Games (GCMGs) can also be captured by defining a suitable agent-specific 
bid, as detailed below. Indeed the susceptibility remains finite and H > for all a > 
in such games. 

One may here think of the following extreme cases: the standard MG, is fully 
independent of i, i.e one has = for all i. The opposite case corresponds to full 
heterogeneity, i.e. A'^ ^ A'^ with probability one for any i ^ j. The purpose of the 
present paper is to study intermediate cases, i.e. MG-type games in which there is 
some heterogeneity in the update rules, but where groups of extensive {0(N)) size use 
the same bid imbalance to update their scores. To this end we will in the following 
introduce a parameter < q < 1, and study models in which the above geometric 
picture hold for a group of qN agents (who all use the same bid-imbalance to update 
their score valuations), but where the remaining {l — q)N agents display heterogeneity 
in their learning rules, q therefore allows to interpolate smoothly between the MG 
without additional heterogeneity {q = 1), and the case of fully heterogeneous agents 
(9 = 0). 
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3. Minority Games with agent-specific impact correction 

We will first address MGs with so-called impact correction [6,7]. Here, agents take 
into account the effects of their own trading actions. We will here not give details of 
the interpretation as these have been discussed at length in the literature [1,2], but 
will start from 

^r[q] = ^ E + s,^^) - K + s,o ■ (3) 

We here assume that the {rn} are drawn independently from a distribution i?(?7i) at 
the beginning of the game, and that is identical for all players. 
Two well-developed techniques, adapted from spin glass physics and the theory of 
disordered systems, are available to study MGs of the type discussed in this paper. 
They rest on equilibrium and non-equilibrium approaches and are based on replica 
theory and generating functional respectively. Both approaches are discussed in 
the recent textbooks [1,2]. We will here not enter the mathematical details of the 
derivation of the resulting effective macroscopic theories. The generating functional 
analysis of MGs with heterogeneous impact correction factors leads to an ensemble of 
effective stochastic processes 

q^{t + 1) = q^{t) - aE(l + ^)(*' ^'^vi*') + ar^s^it) + V^C^(0, (4) 
v 

one for each value of 77 in the support of R{-). The relevant macroscopic order 
parameters are the correlation and response functions, and arc given by 



where 



C{t,t') = j dvR{v)Cr,it,t'), G{t,t') = j dr]R{r])Gr,it,t'), 



Cn{t,t') = {{sr,{t)sr,{t')\ v)) , Gn{t,t') = ^ll '^'"^^^ 



(5) 



(6) 



Here (( • | r/)) refers to an average over realisations of the effective process for a given 
value of r] only. The noise Cr/ has covariance 

((C.(*)C.(*')i^)) = [(I + G)-iD(I + G^)-i](t,0 (7) 
where -D(t, t') = 1 + C(t, t') for all t, t', and I is the identity matrix. We here note that 
the covariance of (^j does not depend on 77, i.e. it is identical over the entire ensemble. 
We still keep the subscript in (^i- 

The analysis then proceeds by assuming a time-translation invariant state {Cri{t, t') = 
Crj{t — t'), Grj{t, t') = G^{t — t')) at finite integrated responses 

Xn = j dT Gnir) < 00. (8) 

We will also write c,, for the persistent part of the correlation function Cj^{t — t'), and 
proceed by taking the long-time average of the effective processes: 

Qn = y/aCr, - ^ + arjSr,, (9) 

with q^i = limt^oo 9r; and s,, and Ct; the asymptotic time-averages of Sr/(i) and 
Cn{t)- Further details of these steps can be found in [2,5]. Note than x appearing in 
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the denominator on the RHS of ([5]) is x and not Xv T^^^ static noise (^^ has mean 
zero and variance 

{{Qv)) = j^ (10) 

(independently of 77), where 

^ drj R{r]) c^, X = / dri R{r]) Xn- (H) 

Defining 

viv) - ^^yj'Y^'^ (12) 

V2(l + c) 

one then finds after separating frozen and fickle agents along the lines of [2,5] 

c. = 1 + i^^erf [.(,)] - ^l^e-C)^ (13) 
for the persistent part of the correlation function C,,, and 

- .(i-,a\x)) "^[^'^^^^ ^''^ 

for the integrated response of agents with impact correction factor 77. 

Equations (|lip - (|14p fully determine the persistent order parameters c and x- We here 

note that the predictability H can be shown to be given by 

(see [1,2] for details) so that a divergence of the integrated response x signals the 
onset of a fully efficient phase, characterised by 7? = 0, as discussed in the previous 
section. 

Since the error function is bounded, one may conclude from Eq. (|14|) that Xv can 
never diverge for any r/ 7^ at finite value of a. More precisely, let us write 

R{v)^qSiv) + ii-q)P{v), (16) 

where J drjP{7f} — 1, and where P{'q) has no mass concentration (e.g. a (5-peak) at 
77 = 0, i.e. where we impose lim^^o j^g d,rjP{r]) — 0. Then using 

dvRiv) (17) 

a(l-77(l + x)) 



we have 



^ - "''^'"^"^^ ■ - q) I d,P(,)^i^^M!L. (18) 



1 + X a J a(l-'7(l + x)) 

If g = then this reduces to 

^ ^ dvP{,)^^^^^?^., (19) 



1 + X J "'a(l -r;(l + x))' 
and in the integral we always have 77 7^ (since P{vi) has no mass at zero). As a 
consequence x can never diverge, since the LHS of this equation approaches unity 
when Ixl 00, whereas the RHS tends to zero (again note the boundedness of the 
error function). We conclude that for q = there can be no x — > cx) transition, and 
hence no efficient phase. 
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Figure 1. (Colour on-line) Left: Phase diagram for -R(»?) = qS{r]) + (1 — 
9)I[-l,0]('7)- Right: H versus q for q = 0,0.1,0.25,0.5,0.75,1 (left to right). 
Lines are from theory, symbols from simulations {aN^ = 1.6 • 10®, lOOO/v'o 
batch iterations, averages over 5 samples). 



To illustrate these findings wc show the phase diagram of the model in which each rji is 
drawn from a fiat distribution P{ri) = I[-i,o] over the interval [—1,0] with probability 
1 — q, and where rji = with probabihty q for each i (I[a,h] (x) = 1 for a < x < b and 
zero otherwise). The left panel of Fig. [T] demonstrates that the integrated response 
diverges at a finite value of a = adq) for any q > 0, resulting in an efficient phase 
at a < adq)- The location of the transition ac{q) tends to zero as g ^ 0, so that 
the transition and efficient phase are absent for q = 0. In the right panel of Fig. 
[T] we depict the predictability H in dependence of a for different values of q, again 
illustrating the fact that a transition between an inefficient and an efficient phase is 
present for any q > 0, but that the phase with positive predictability persists for all 
Q > in the case q = 0. We here note that simulations at small values of a are costly 
in terms of CPU time and that numerical data at a smaller than approximately 0.02 
may hence be prone to finite-size and equilibration effects. To conclude this section 
we here note that we have explicitly excluded the case 77 > 0, as it is known that 
a memory-onset (MO) transition here precedes the transition marked by a diverging 
integrated response [6,7]. The MO transition appears to persist when heterogeneous 
distributions of the {rji} are considered and if at least a finite fraction of impact 
correction factors is positive, pre-empting the occurrence of an efficient phase. 

4. Minority Games with heterogeneous comfort levels 

We next consider MGs with heterogeneous comfort levels, sometimes also referred to 
as El-Farol games or 'clubbing' games [12, 13]. In such games agents do not apply a 
strict minority rule when updating their scores (the minority rule dictates that a good 
move is to make a bid of the opposite sign of the total bid in the market), but apply 
an individual comfort level A^. Thus the effective bid Ai agent i uses to update his 



On the transition to efficiency in Minority Games 
strategy valuations is Ai{t) = A{t) — Aj, and we have 

q,{t + 1) = q,it) - ^ E C ^ E {^"^.-(i) + ^j} - >^ 



+ hi{t). (20) 



Similar to the case of market impact correction we assume that the {Ai} are drawn 
independently from a distribution R{\i) at the beginning of the game. 
The efficient phase here corresponds to a regime in which the population of agents 
is able to co-ordinate their overall behaviour so that the mean total attendance {A) 
corresponds (on average) to the mean comfort level. 

The generating functional analysis again results in an ensemble of effective agent 
processes, one for each comfort level 

qx{t + 1) = qx{t) - a E(I + G)«'sA(i') + ^Tlx{t) + hx{t). (21) 
t'<t 

sx{t) is given by s\{t) = sgn[q\{t)] and r]x{t) is Gaussian noise of zero mean, and with 

temporal correlations 

{{Vxit)vxit')\X)) - [(I + G)~^D{I + G^)-%, + 2ftft, - 2X{,ft + Jv) + 2\^Eu{22) 

Here lu' = Su' is the identity matrix and Eu> = 1 ^t,t' , G and G are the correlation 
and response functions of the system respectively , and we find an additional dynamical 
order parameter G': 



G,,= >i„A.-.i:A.fff. (23) 

The matrix D is, as before, given by Da' = 1 + Ctt' for all t, f and we have 

/, = E[(I + G)-^G')W. (24) 
t' 

These order parameters are then to be determined as averages over realisations of the 
effective processes and over the distribution of A 

Ctt' = J dXR{X) {{sx{t)sx{t')\X)) (25) 
Gtt'= ldXR{X)^j^^{{s,m)) (26) 

G'tt,= j dXR{X)X^j^^{{sx{t)\X)) (27) 

(where ((• • • |A)) is an average over realisations of the effective process restricted to 
representative agents with comfort level A). 

Assuming time-translation invariance (i.e. Gtf = C{t — t') and similarly for Gtt and 
Gj(,) and finite integrated response one follows the standard ansatz to proceed from 
the effective agent problem to explicit equations characterising the relevant persistent 
order parameters of the ergodic stationary states. In our problem these are given 
by c, the persistent part of the correlation function, and by % = G{t) and 
x' = E.G'(r). 
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The resulting 3x3 system of non-linear equations for {c, x, x'} then reads 

dAi?(A)cA, x= fdXRWxx, x' = I d\ R{\)\ XX, (28) 



with i?(A) the distribution from which the comfort levels {A,} are drawn and where 
X\ and Ca given by 

XX = erf w(A ) , 

a 

where 



with g{X) is the persistent part of the temporal correlations of the noise r]x{t) in the 
effective agent problem: 

^i^) = JlTW^^'i^Tx?-''^x^'''■ '''' 

The mean attendance level comes out as 

(A) = ^. (32) 

We next perform an analysis similar to the one of the MGs with impact corrections, 
and look for possible divergences of % and x'- One has 



X 



a-^ J dXR{X)ei{ {w{X)) (33) 
a-^ j dXR{X)Xerf {w{X)) (34) 



1 + X 

and 

X' 

1 + X 

For X ~^ oo, we find from the former relation 

1 = lim / dXR{X)eii {w{X)) , (35) 

i.e. not all w{X) can go to zero as x — > oo. If this happened for all A for which -R(A) 
has mass, then the RHS would go to zero. 

Thus, in order for a transition at diverging integrated response to occur, the mass of 
all A for which lim^^-xx) |fi'(A)(l + x)^l < must be positive. Now, we have 

5(A) (1 + x)' = 1 + c + 2x'' - 4Ax'(l + x) + 2A2(1 + xf- (36) 

Assuming reasonably that x'/x ^ 7 as x ^ oo, with 7 finite, one finds that 
£((A)(1 + x)^ remains finite, if and only if 

7^ + A^ - 2A7 = (37) 

i.e. if A = 7. In other words, if x'/x ^ 7 as the susceptibilities diverge, then the 
transition can only exist if R{X) has positive mass concentration (delta-peak) at A = 7. 
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Figure 2. (Colour on-line) Left: Phase diagram for -R(A) = q5{X — 1/2) + (1 — 
g)I[Q 1] (A). Right: Deviation of the mean overall bid (A) from the mean comfort 
level A = 0.5 versus a for 5 = 0, 0.1, 0.25, 0.5, 0.75, 1 (left to right). Lines are from 
theory, symbols from simulations {aN'^ = 1.6 ■ 10^, lOOO/v^Q batch iterations, 
averages over at least 5 samples). 



Let us take for example a symmetric distribution of A around a value A. Then one 
has i?(A — A) = i?(A + A). Now x' — is then a self-consistent solution at the 
transition, because 

and g{A + A) = ^(A — A) for diverging x, x' and x'/x ~* ^■ 

Thus if one has a symmetric distribution of A around a value A, then the x ^ co 
transition can exist only if R{X) has a delta-peak at its mean A (plus other symmetric 
contributions). This is confirmed in Fig. [2] where we depict the resulting phase 
diagram for R{X) ~ qS{X — 1/2) + (1 — q)I[o,i](A) at different values of q, and where 
a phase in which the mean comfort level A = 0.5 is retrieved (| (A) — A| = 0) is found 
for any g > 0, but where such a phase is absent at g = 0. 



5. Grand-canonical Minority Game 

Here we consider the case of grand-canonical MGs [11] with heterogeneous incentives. 
In GCMGs agents hold only one active strategy, but have the option to abstain from 
taking any action at any given time step. The batch upate rule for the scores of active 
strategies can then be written in the form [16] 

q^{t + l) = q,{t) - j^^^at^^^jnjit) " "^i^ (39) 
i M 

where the {af} encode the active strategies of players, and where the {ci} are 

incentives of agents not to trade. ni{t) is given by ni{t) — Q[qi{t)\. One realises 
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that the geometric interpretation of the score-vector moving in the space spanned by 
the disorder holds when = 0, and in this case a transition between an efficient and a 
non-efficient regime is indeed observed [1,2,11]. The generating functional analysis for 
the GCMG is detailed in [2,16]. The representative process for agents with incentive 
e reads 

qeit + 1) = - a ^(I + G)(t, t')n,it') - ae + v^z(t), (40) 
t'<t 

where nf{t) — Q[q^{t)] and 

{z{t)z{t')) = [{I + G)-'C{l + G^)-']{t,t') (41) 

G{t,t')^ J deR{e)C,{t,t') = j deR{e) {n,{t)n,{t')) (42) 

G{tX) = j deR{e)GS,t') = ^/ ^^^^^^^ ^^^^ 

(•) here denotes an average over the static noise z. In the ergodic steady state, the 
effective noise z{t) becomes a static random variable with variance (z^) = c/{l + x)^, 
withc = j deR{e)c^ and x = / deR{e)xe- Following [16] and introducing the shorthand 
7 = y/a/{l + x), the persistent autocorrelation and susceptibility at fixed e are found 
to be given by 

c, = (eiz - 7 - eV^)) + ^ ^^"^/"^% (7 + eV^- z)9{z - e^^)^ (44) 
Xe = -^{0{l + eV^~z)e{z^e^)) (45) 



The above equations can be solved numerically, and in turn c and x can be computed 
for any distribution of incentives i?(e). As is customary, we divide the N agents in 
two groups: Np producers with e = —oo (for whom c_oo = 1 and X-oo — 0) and Ng 
speculators with finite e. We further fix = Np/P — 1 and use Ug — Ng/P as the 
control parameter in place of a, which is now given by a = l/(ns -|- n^). For the 
sake of simplicity, we work out the theory in detail only for the case where R{e) is 
symmetric around e = 0, namely 

R{e) ^ q6{e) + ^ [S{e e) + S{e + e)] (46) 

with e a constant. The standard case corresponds to g = 1 and displays a transition 
with diverging x from an efficient (H = 0) to a non efficient {H > 0) regime when rig 
decreases below the critical value n* ~ 4.15. 

In Fig. [3] we compare analytical predictions for different q and e = 0.25. As 
expected, the phase diagram shows that the efficient phase shrinks as q increases 
and, accordingly, the predictability vanishes. 

Notice that the situation improves if an asymmetric distribution R{e) is considered. 
Indeed the dashed green line in Fig. [3] represents the boundary between the efficient 
and the inefficient phase when R{e) — q6{e) -I- (1 — q)6{e — 1), showing that even for 
very small but non-zero q agents may wash out predictability from the time series of 
bid imbalances. (Still, however, when q = no efficient phase occurs.) 
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Figure 3. (Colour on-line) Left panel: phase diagram of the grand-canonical 
MG with heterogeneous incentives. The continuous line corresponds to the case 
where R(e) is as in 1 146 I I. the dotted horizontal line marks the transition point n* 
of the original model. Right panel: H vs rig for q = 1,0.8,0.6,0.4,0.2 (left to 
right). Markers correspond to simulations with fixed PNs = 32000, averages over 
100 disorder samples. 



6. Conclusions 

To summarize, we have studied the phase structure of batch MGs with mixed 
populations of optimally and sub-optimally learning agents, probing the robustness of 
the efficient phase against modifications of the agents' learning rules in the direction 
of increasing heterogeneity. Regimes with zero predictability turn out to survive for 
any finite fraction of optimal agents. It would be interesting to know whether altering 
the information structure, that is taken to be fixed throughout the models discussed 
here, can modify this picture. For instance, a single agent (or a finite group of agents) 
with access to a more informative signal may alter this scenario or manage to take 
advantage from it. This direction is to our knowledge unexplored so far, with the 
possible exceptions of [17, 18] and some studies along these lines would be in our 
opinion worthwhile. 
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